Title 

METHOD AND RELATED APPARATUS FOR REORDERING ACCESS REQUESTS 
USED TO ACCESS MAIN MEMORY OF A DATA PROCESSING SYSTEM 
5 Cross Reference To Related Applications 

The application claims the benefit of U.S. Provisional 
Application No. 60/440, 046, which was filed on 01/15/2003 and 
- entitled "A New Latency-Controlled Reorder Method for 
High-Perf ormance Data Computing System"'. 
10 Background of Invention 

1. Field of the Invention 

r • 

i 

The present invention relates to a method and related 
15 apparatus for accessing a memory device of a computer system. 
In particular, the present invention discloses a method and 
related 'apparatus for reordering access requests used to 
access, the memory device of a computer system. 

20 2. Description of the Prior Art " 

!•' 

i • 
i ' 

Data [processing systems are systems that manipulate, 

i 

process, j and store data, and are well known in the art. 

Personal computer systems and their associated are such data 
25 processing systems. Fig.l is a diagram of a prior art computer 

system 1*0. The computer system 10 comprises a central 

processing unit (CPU) 12, a north bridge circuit 14, a south, 
- bridge circuit 16, a display controller 18, a memory device 

20, an li/0 device 22, and a hard-disk drive 24. The CPU 12 
30 is used to control the operation of the computer system 10. 

The no.rth bridge circuit 14 is used to arbitrate signals 

transmitted between the CPU 12 and high-speed peripheral 



devices such as the display controller 18 .and the memory 
device 20. The display controller 18 is used to handle display 
data, and the memory device 20 is used to process and store 
data. The south bridge circuit 16 is used to arbitrate signals 
5 transmitted between the north bridge circuit 14 and low-speed 
peripheral devices such as the I/O device 22 and the hard-disk 
drive 24!. The I/O device 22 (a keyboard for example) is used 
to receive control signals inputted by an end-user. The • 
hard-disk drive 24 usually is a non-volatile memory device, 
10 while the memory device 20 usually is a volatile memory device. 
The memory device 20 may include a dynamic random access 

memory (DRAM) to store processing programs and data. For 

I , • 

example,- the programs and data stored on the hard-disk drive 

24 are loaded into the memory device 20 so that the CPU 12 

15 is capable of executing the programs more.quickly . Then, the 

data processed' by the CPU 12 are stored back to the memory 

device 20. It is noted that the north bridge circuit 14 has 

a memory controller 26 for controlling the access of the 

memory device 20. When a master device such as the CPU 12 or 

20 the display controller 18 issues access requests to the memory 

i 

controller 26, the memory controller 26 accesses the memory 

device 2;0 according to the received access requests. With an 

in-order access scheme, the memory controller 26 responds to 

the master device according to the order of the received 

25 access requests. For example, the display controller 18 

outputs read requests RA1, RA2, RA3, RA4, and RA5 in an order 

i ■ 
to acquire data Dl, D2, D3, D4, and D5 stored in the memory 

device 20 respectively. Even the memory controller 26 

executes the read requests RA1, RA2, RA3, RA4 , and RA5 out 

30 of its original order, the memory controller 26 will still 

orderly 'transmit the retrieved data Dl, D2, D3, D4, and D5 

to the display controller 18. 



Fig .2 is a diagram showing a first prior art data access 
operation of the memory device 20. For example, the CPU 12 
sequentially produces read requests RA1 , RA2, RBI, RB2 , RA3, 
5 ' RA4, RB3, RB4,*RA5, RA6 for acquiring data DAI, DA2, DB1, DB2, 
DA3, DA4, DB3, DB4, DA5, DA6 , which are stored in the memory 
device 20. It is assumed that data DAI, DA2, DA3, DA4, DAS, 
DA 6 are all stored on page A of the memory, and data DB1, DB2, 
DB3 , DB4 are all stored on page B . As shown in Fig . 2 , the memory 

10 controller 26 executes the read requests RA1 , RA2, RBI, RB2, 
RA3 , RA4 \ RB3, RB4, RA5, RA6 in its original order. The memory 
controller 2 6 executes the read request RA1 to retrieve data 
DAI. Thei memory device 20 transmits the acquired data DAI to 
the memory controller 26 via a memory bus . The memory 

15 controller 26 then transfers the data DAI to- the CPU 12, 

thereby completing the data . retrieval for the read request 
RA1. Since data DA2 called by the next read request RA2 is 
also located on the same page A, the memory device 20 adopts 
a burst mode to access the data DA2 . The data DA2 is^ quickly 

20 retrieved, and is transmitted to the memory controller. 26 via 
the memory bus. The memory controller 26 then transfers the 

i . • . 

data DA2 : to the CPU 12 to finish the data delivery for the 
read request RA2. However, the read request RBI is expected 
to retrieve data DBl that is located. on page B which is 

25 different from page A. In order to access the desired page 
B, the memory device 20; needs to precharge the opened page 
A-first before activating the wanted page B. Therefore, the 
memory controller 26 successively issues a command PreA for 
precharcfjing the opened page A, and issues a command ActB for 

30 activating the wanted page B. After page B is successfully 
opened, /the memory controller 26 is ready to access data 
stored on page B. Read requests RBI, RB2 are sequentially 



executed by the memory controller 2 6 to retrieve 
corresponding data DB1 , DB2. Then, the memory device 20 
provides data DB1, DB2 to the memory controller 26 via the 
memory bus. The memory controller 26 transfers the acquired 
5 data DB1., DB2 to the CPU 12, thereby completing the data 
retrieval for the read requests. RBI, RB2 . As the next command 

i • 

calls for data retrieval from page A, commands PreB, ActA are 
successively issued to precharge the opened page B and 
activate the wanted page A because data DA3 is located on page 

10 A. The above-mentioned process is performed for accessing 
following data DA3, DA4, DB3, DB4, DA5, DA 6 by switching back 
"and forth between page A and B .' As it is shown, when two 
adjacent^ read requests retrieve data located on different 
pages, the memory device 20 has to spend extra time to switch 

15 between pages. Consequently, the precharging operation and 
the activation operation deteriorate the overall performance 
of the memory device 20. As shown in Fig. 2, latency Ll of the 
memory bus is introduced owing to the precharging operation 
and the activation operation. Similarly, latency L2 of the 

20 host bus is also introduced accordingly due to the imposition 

of li. : 

Generally speaking, when a master device such as the CPU 

I - - • 

12 accesses the memory device 20, the precharging operation 

25 and the activation operation worsens the performance of the 
overall data access operation. Therefore, a prior art 
reordering scheme is used to diminish the effect caused by 
the precharging operation and the activation operation. Fig. 3 
is a diagram showing an improved data access operation of the 

30 memory device 20. The CPU 12 sequentially generates read 

requests RA1, RA2, RBI, RB2, RA3, RA4, RB3, RB4, RA5, RA6 for 
acquiring data DAI, DA2, DB1, DB2, DA3, DA4, DB3, DB4, DAS, 



A 



DA 6 stored in the memory device 20. The memory controller 26 
reorders the received read requests RA1, RA2, RBI, RB2, RA3, 
RA4 , RB3, RB4, RA5, RA6 so that the memory controller 26 
executes the read requests RA1, RA2, RA3, RA4 , RA5 , RA6before 
5 it executes RBI, RB2, RB3, RB4, thereby reducing the 'time 
spent on switching between two different pages. As shown in 
Fig. 3, after the read request RA6 is successfully executed, 
commands PreA, ActB are issued by the memory controller 26 
for precharging page A and activating page B. Only one page 
10 switch operation is performed, and only one latency LI is 
introduced for the memory bus. However, the memory controller 

26 has to transfer the retrieved data to the CPU 12 v according 

f 

to the original order defined by the sequence in which 
requests are issued. In other words, the memory controller 

15 2 6 needs to deliver data DAI, DA2, DBl, DB2, DA3, DA4 , DB3, 
DB4, DAS, DA 6 to the CPU 12 sequentially. Even if the memory 
device 20 retrieves data DAI, DA2, DA3, DA4, DA5, DA 6 quickly 
in a burst mode, after delivering DAI and DA2, the host bus 
has to wait until the data DBl has been retrieved by the read 

20 request RBI. It is noted that since DA3, DA4, DA5, DA 6 have 

been successfully retrieved and transferred to the memory 

i 

controller 26, data DA3, DA 4 are immediately transmitted to 
the CPU 12 after data DB2 has been transmitted to CPU 12. As 
such, only one latency L2 is introduced for the host bus. 

25 Comparirig with the first data access operation illustrated 
in Fig-. -2, the data access operation illustrated in Fig. 3. 
needs fewer page switch operations, but with a much greater 
latency ;L2 for the host bus. The performance of the memory 
bus is improved at the cost of the performance of the host 

30 bus. If the read request RBI is delayed significantly due to 
the reordering of the requests, the latency L2 can seriously 
deteriorate the overall data access operation. It is thus also 



hard to benefit from the reordering of the read requests as 
illustrated in Fig. 3. 

Summary of Invention 

5 

It is therefore an objective of this invention to provide 
a method and related apparatus capable of reordering access 

requests used to access a memory device. 

• i • . 

i ' ■ 

10 Briefly summarized, the preferred embodiment of the 

claimed invention discloses a method for accessing a memory 

[ - . 

device of a computer system, the method comprising receiving 
one or more access requests for accessing the memory device 
in a first predetermined order, and reordering the access 

15 requests in a second predetermined order to be processed in 
a request queue by relocating a first access request to follow 
a second access request accessing a same memory page to 
increase processing efficiency. In addition, the relocating 
is prohibited if it increases a processing latency for a third 

20 access request to exceed a predetermined limit. 

The {claimed invention is capable of appropriately 
balancing performance of the memory bus and performance of 
the host bus to optimize overall performance of the data 
25 access operation. 

i ' " . 

I 

! ■ ■ 
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These and other objectives of the present invention will 

i 

no doubt; become obvious to those of ordinary skilled in the 

! 

art aftejr reading the following detailed description of the 
30 preferred embodiments, which are illustrated in the various 
figures 'and drawings. 



Brief Description of Drawings 



Fig.l is a diagram of a prior art computer system. 
Fig. 2 is a diagram showing a first data access operation of 
5 a memory device shown in Fig.l. 

Fig. 3 is. a diagram showing a second data. access operation of 
.the memory device shown in Fig.l. 

Fig. 4 is a block diagram of a memory controller according to 
the present invention. 
10 Fig. 5 is a flow chart showing a reordering process according 

to one example of the present invention. 

( : * 

Fig. 6 is a diagram showing a data access operation of the 

I 

memory device according to the reordering process of Fig. 5. 
15 * Detailed Description 

Fig. 4 is a sample block diagram of a memory controller 30 
according to one example of the present invention. The memory 

20 controller 30 comprises a page/bank comparing unit 32 , a 
latency control uni-t 34, a reorder decision-making unit 36, 
a request queue selection unit 38, a request queue 40, a 
latency monitoring unit 42, and a memory access state machine 
44. In addition, the request queue 40 includes a plurality 
.25 of queue: entries 46, that is, Ql, Q2, Q3, Q4, Q5, Q6, Q7, Q8, 
Q9, Q10, ;Q11, Q12, and the latency monitoring unit 42 includes 
a plurality of monitoring registers 48, that is, Ml, M2, M3, 
M4, M5, M6, M7, M8, M9, M10, Mil, M12 corresponding to the 
queue entries Ql, Q2,Q3, Q4, Q5, Q6, Ql, Q8, Q9, Q10, Qll, 

30 Q12 respectively . Each of the monitoring registers Ml, M2, 
M3, M4, M5, M6, M7 , M8, M9, M10, Mil, M12 is used to record 
a latency value of the corresponding queue entry. The queue 



entry Qi is a leading entry with a highest priority, while- 
the queue entry Q12 is a tail entry corresponding to a lowest 
priority. In other words, the request stored in the queue- 
entry Ql; will be executed first. 

5 

When access requests issued by a master device are 
received by the memory controller 30, the access requests are 
inputted into the request queue 40. Each of the access 
requests includes address information, read/write 

10 information, length information,, and some necessary flags 
(request priority values and write-back indicators for 
example);. It is noted that the master device mentioned above 
can -be the CPU 12, the display controller 18, the hard-disk 
drive .24, or the I/O device 22 shown in Fig.l. The page/bank 

15 comparing unit 32 identifies ' pages expected to be accessed 
by the received access requests, and informs the reorder 
decision-making unit 36 of the page information. The latency 
control unit 34 detects a signal from the latency monitoring 
unit 42 to determine whether latency values stored in the 

20 monitoring registers 4 8 has reached a predetermined - limit 
such as I a maximum allowance; value. In addition, if the 
monitoring register 48 stores the maximum allowance value, 
the latency control unit 34 will block the reorder 

decision-making unit 36 from delaying an access request kept 

i - . 

i . • ;• 

25 in a queue entry corresponding to the monitoring register- 48. 
The reorder decision-making, unit 36 receives information 
outputteld from the page/bank comparing unit 32 and the latency 
control unit 34 to. determine positions of the access requests 
in the request queue 40. That is, the page/bank comparing unit - 

30 32 is capable of identifying currently opened pages, and gives 
Information about recommendatory positions of the received 
access requests in the request queue 40 to the reorder 



decision-making unit 36. Then, the reorder decision-making 
unit 36 sends a command to the request queue selection unit 
38for adjusting positions of access requests in the request 
queue 40. As such, the access requests are reordered to be 
5 stored in different queue entries 46 so that the access 
requests' have new positions in the request queue 40. The 
priorities of the access requests are then adjusted. The 
memory access state machine 44 then sequentially accesses the 
memory device 20 according to the access requests stored in 
10 the queue entries 46. 

Fig. 5 is a flow chart showing the operation of an access ' 
request reordering process according to the present invention. 
Detailed operation of the reorder process is described as 
15- follows. 1 The memory controller 30 receives a first access 
request. The page/bank comparing unit 32 identifies that the 
first access request is to access a first page within the 
memory device 40 (step 100) . The memory controller 26 then 
checks whether the request queue 40 is empty. If there is 

20 currently no access request in the request queue 40, the 

! 

reorder decision-making unit 36 does not need to perform the 

i 

reordering process, and the received first access request is 

just pushed into the request queue 40. In addition, the first 

I 

access request is kept in the queue entry Ql with the highest 

■ i 

25. priority, and an initial value (0 for example) is assigned 
to the corresponding latency value stored in the monitoring 
register; Ml. On the other hand, if the request queue 40 
contains ; an access request, in step 102, it is determined 
whether t:here is any access request in the request queue 40 

30 for accessing the first page. If no access request in the. 

i ' . 

request iqueue is to access the first page, the reorder 
decision-making unit 36 does not need to perform the 

i _ . . . 

j • 



■i r 

reordering process, and the received first access request is 
just pushed into the request queue 40 to have a lowest priority 
within the access requests in the request queue 40. If, in 
step 102, the page/bank comparing unit 32 finds out that there 
5 is a second access request in the request queue 40 used to 
access the first page the reorder decision-making unit 36 
determines whether the second access request is the last one 
or has the lowest priority (step 106). If the second access 

! 

request has the lowest priority, the reorder decision-making 

t * 

10 unit 36 does not need to perform the reordering process, and 
the received first access request is just pushed into the 
request cjueue 40 (step 104) . In other words, the'stored first 

access request then has the lowest priority. If the second 

i 

access request does not have the lowest priority in the queue, 
15 the step 108 is performed to check whether the reorder 
decision-making unit 36 should begin reordering the access 
requests. That is, if a latency value of a third access request 
following the second access request is going to be increased 
by an incremental value caused by executing the first access 
20 request jbefore the third access request and such increased 

value is'greater than a predetermined maximum allowance value, 

i 

the reorder decision-making unit 36 can not start the 
reordering process because too much delay will be introduced 

before executing the third access request. The first access 

i " ■ 

25 request Is then pushed into the request queue 40 as normal. 

Back in ' step 108, if the maximum allowance value is not 

i 
i 

breached, the reorder decision-making unit 36 instructs the 
request queue selection unit 38 to insert the first access 
request between the second access request and the third access 
30 request (step 110). In addition, the latency values of the 
access requests starting- from the third access request are 
increased accordingly due to the insertion of the first access 



1 A 



request (step 112). It is noted that the first request and 
the second request are intended to access the same first page 
within the memory device 40. It is further noted that the 
second request and the third request have already been stored 
in the request queue 40, and the second request has. a queue 
priority greater than that of the third request. The initial 
value is l assigned to the latency value stored in a monitoring 
unit 48 corresponding to a queue entry 46 associated with the 

first access request. 

j 

i ' ••. ■ ■ - 

The main objective for having the maximum allowance value 

is to prevent the latency of the host bus from being seriously 

impacted. Newly received access request accessing a 

predetermined page is annexed to those access requests 

accessing the same predetermined page for reducing the time 

spent on; switching between different pages. However, if the 

insertion of the received access request makes the delayed 

access requests have latency values greater than the maximum 

i 

allowance value, the reordering process is skipped. The 
latency jof the host bus is thus not going to .be unduly 
prolonged, and the overall performance of the data access 
operation is not compromised. 

i 

■ i 

In order to better describe features of the present 
invention, an example is introduced. Suppose that the maximum 
allowance value is 2 and the incremental value caused by the 

inserted access request, for latency value calculation 

■ j 

purpose,' is 1 for the access request behind the inserted 

! • ' ■ 

access request. It is noted that values other than 2 and. 1 

i 

can be ^assigned to the maximum allowance value and the 

i ' 
incremental value respectively depending on the design of the 

computer system as they are fully programmable. It is assumed 



that a master device (CPU 12 for. example) sequentially 
produces read requests RA1, RA2, RBI, RB2 , RA3, RA4 , RB3, RB4 , 
RA5 , RA6'for acquiring data DAI , DA2 , DBl f DB2 , DA3, DA4 , DB3, 
DB4, DAS, DA 6 stored in the memory device 20. It is noted that 
5 data DAi, DA2, DA3, DA4 , DA5, DA6 are all stored on page A 
of the memory device 20, and data DB1, DB2, DB3, DB4 are all 
stored on page B of the memory device 20. It is understood 
for the purpose of this disclosure, reordering operation for 
a read request is used as an example of the access request, 
10 but the same concept can apply to a write request as well. 

The request queue for the access requests, queue entries, and 

i ■* * 

the monitoring registers (collectively "reordering 

parameters") are listed as. follows: 

! • . 

15 Read requests: {RA1, RA2, RBI, RB2 , RA3, RA4 , RB3, RB4 , RA5, 

RA6 } 

Queue entries: {Ql, Q2, Q3, Q4, Q5, Q6, Q7, Q8, Q9, Q10, Qll, 

Qi2} | : 

Monitoring registers^ {Ml, M2, M3, M4, M5, M6, M7, M8, M9, 
20 M10, Mil, M12} 

As there is no read request stored in the request queue 

4 6 when RA1 is processed, the read request RA1 is pushed into 

the request queue 4 6 (steps 101,. 104), and is stored by the 

25 queue entry Ql. In addition, an initial value (0 for example) 
i ■ . 

is assigned to the monitoring register Ml corresponding to 

the queue entry Ql. The reordering parameters have now become: 

I 

Read requests: { RA2 , RBI, RB2, RA3, RA4, RB3, RB4, RA5, RA6, 
30. RB5 , RB6 } 

Queue entries: {Ql (RA1 ) , Q2, Q3, Q4, Q5, Q6, Q7 , Q8, Q9, Q10, 
Qll, Q12} 



Monitoring registers: {Ml (0), M2, M3, M4, M5, M6, M7, M8 , 
M9, M10, Mil/ M12} 

Although the read requests RA1 , RA2 both retrieve data 
5 stored oh the same page, the read request RA1 in the request 
queue 40 is the only and last one, the- read request RA2, 
therefore, is also pushed into the request queue. 40 (steps 
• • 106, 104) and is stored in the queue entry Q2 . In addition, 
the initial value is .assigned to the monitoring register M2 
10 corresponding to the queue entry Q2 . The reordering 
parameters now look like: 

Read requests: {RBI, RB2, RA3, RA4 , RB3 , RB4 , RA5, RA6, RB5, 
RB6 } • ; . ■ 

15" Queue entries: {Ql (RA1) , Q2 (RA2) , Q3, Q4, Q5, Q6, Q7, Q8, 
Q9, Q10, Qil, Q12} 

Monitoring registers:" {Ml. (0) , M2 (0), M3, M4, M5, M6, M7, 
M8, M9, M10, Mil, M12} 

20 * Now, the read requests RA2, RBI retrieve data stored on 

different pages A and B. The read request RB2, therefore, is 
just pushed into the request queue 40 (steps 102, 104) and 
is stored in the queue entry Q3. In addition, the initial value 
is assigned to the monitoring register M3 corresponding to 

25 the queue entry Q3. The reordering parameters now look like: 

Read requests: { RB2 , RA3, RA4, RB3, RB4, RA5, RAG, RB5, RB6 } 
■ Queue entries: {Ql (RA1) , Q2 (RA2), Q3 (RBI) , Q4, Q5, Q6, Q7, 
Q8, Q9, Q10, Qll, Q12 } 
30 Monitoring registers: {Ml (0), M2 (0), M3 (0), M4, M5, M6, 
M7, M8, M9, M10, Mil, M12} 



The read requests RBI, RB2 now both retrieve data stored 
on the same page B. However, the read request RBI is the last 
one in the request queue 40. The read request RA2, therefore, 
is just pushed into the request queue 40 (steps 106, 104) and 
5 is stored in the queue entry Q4 . In addition, the initial value 
is assigned to the monitoring register M4 corresponding to 
the queue entry Q4 . The reordering parameters now have become: 

i ■ "... 

Read requests: {RA3, RA4, RB3, RB4, RA5, RA6, RB5 , RB6 } 
10 Queue entries: {Ql (RA1) , Q2 (RA2) , Q3 (RBI), Q4 (RB2) , Q5, 
Q6, Q7, Q8, Q9., Q10, Qll, Q12} 

Monitoring registers: {Ml (0) , M2 (0), M3 (0) , M4 (0) , M5, 
M6, M7, M8, M9, M10, Mil, M12} 

i " - ' 

15 The read requests RA2, RA3 demand required data stored on 

the same 1 page A (step 102) . In addition, the read request RA2 
is not the last request in the request queue 40 (step 104). 
It is determined that the latency value of the monitoring 

register! M3 increased by the incremental value is not greater 

■'i • 

! 

20 than the' maximum allowance value (step 108) . Therefore, the 

i 

read requests RBI, RB2 are rearranged by positioning the read 

I ■ ■ ■ ■ 

request RA3 after Q2(step 110) . The queue entry Q3 now stores 

i 

the read: request ra3 instead, and the read requests RBI, RB2 
are delayed, and stored by queue entries Q4 and Q5 
25 respectively. Because the read requests RBI, RB2 are delayed, 
the related latency values are incremented accordingly. In 

i 

addition, the initial value is assigned to the monitoring 
register M3 corresponding to the queue entry Q3. The resulting 
parameters are listed as follows. 

30 

Read requests: { RA4 , RB3, RB4, RA5, RA6, RB5, RB6 } 

Queue entries: {Ql (RA1) , Q2 (RA2) , Q3 (RA3) , Q4 (RBI), Q5 

i 
i 
i 

! • 

I ■ ■ . 

i 



(RB2), Q6, Q7, Q8, Q9, Q10, Qll, Q12 } 

Monitoring registers: {Ml (0), M2 (0), M3.(0), M4 (1), M5 (1), 
M6, M7, M8, M9, M10, Mil, M12} 



5 The read request RA4 accesses data stored on the same page 

A as RA3(step 102) . In addition, the read request RA3 is not 
the last request in the request queue 40 (step 104). It is 
also obvious that the latency value of the monitoring register 
M4 after the adjustment is not greater than the maximum 
10 allowance value (step 108) . Therefore, the read requests RBI, 
RB2 are further pushed back due to the insertion of the read 
request RA4 (step 110) . The queue entry Q4 now stores the read 
request ' RA4 instead, and the read requests RBI, RB2 are 

delayed to be stored by queue entries Q5 and Q6 respectively. 

j 

15 Because 'the read requests RBI, RB2 are delayed, the related 
latency Values are adjusted again by the incremental value. 

i 

In addition, the initial value is assigned to the monitoring 
register M4 corresponding to the queue entry Q4 . The 
reordering parameters are now listed as follows. 
20 j 

I 

Read requests: { RB3, RB4, RA5, RA6, RB5, RB6 } 

. ' I 

Queue entries: {Ql (RA1), Q2 (RA2) , Q3 (RA3) , Q4 (RA4), Q5 

i 

(RBI), Q6 (RB2), Q7 , Q8, Q9, Q10, Qll, Q12} 

I ' ■ . 

Monitoring registers: {Ml (0) , M2 (0) , M3 (0) , M4 (0) , M5 (2) , M6 

25 (2) , M7, M.8, M9, M10, Mil, M12} 

j 

i 

! 

Now, , the read requests RB2, RB3 both intend to retrieve 

data stored on the same page B. However, the read request RB2 

I 

in the request queue 40 is the last one. The read request RB3, 
30 therefore, is pushed into the request queue 40 (steps 106, 
104) and is stored in the queue entry Q7 . In addition, the 
initial ] value is assigned to the monitoring register M7 



corresponding to the queue entry Q7 . The parameters are as 
-follows: 



Read requests: { RB4 , RA5, RA6, RB5, RB6 } 
5 Queue entries: {Ql (RAl) , Q2 (RA2) , Q3 (RA3) , Q4 ( RA4 ) , Q5 
(RBI), Q6 (RB2), Q7 (RB3) , Q8, Q9, Q10, Qll, Q12} 
Monitoring registers: {M1(0), M2 ( 0 ) , M3 ( 0 ) , M4 ( 0) , M5(2), M6 
(2) , M7(0), M8, M9, M10, Mil, M12} 

10 Similarly, the read requests RB3, RB4 both retrieve data 

stored on the same page B, and the read request RB3 in the 
request j queue 40 is the last one. The read request RB4, 
therefore, is also pushed into the request quesue 40 (steps 
106, 104:) and is stored in the queue entry Q8 . In addition, 

15 the initial value is assigned to the monitoring agister M8 ; - 
corresponding to the queue entry Q8 . The parameters are shown 
as follows: 

i , . 

Read requests: {RA5, RA6, RB5, RB6 } 

20 Queue entries: {Ql (RAl) , Q2 (RA2) , Q3 (RA3) , Q4 ( RA4 ) , Q5 

(RBI), Q6 (RB2), Q7 (RB3) , Q8 (RB4), Q9, Q10, Qll, Q12} 

Monitoring registers-: (M1(0), M2 (0) , M3(0), M4(0), M5(2), M6 

j ' 

(2), M7(0), M8(0), M9, M10, Mil, M12 } 

j ' 

25 The read requests RA4 and RA5 access data stored on the 

same page A (step 102), and the read request RA4 is not the 
last request in the request queue 40 (step 106) . If the read 
request ; RB1 next to the read request RA4 is delayed again, 
the corresponding latency value increased by the incremental 

30 value will become 3, and will exceed the maximum allowance 
value (step 108) . Therefore, the read requests in the request 
queue 40 are not reordered. The read request RA5 , therefore, 



is pushed into the request queue 40 (step 104) and is stored 
in the queue entry Q9 . In addition, the initial value is 
assigned to the monitoring register M9 corresponding to the 
queue entry Q9. The parameters are shown as follows: 

5 

Read requests: { RA6, RB5 f RB6 } 
. Queue entries: {Ql (RAl) , Q2 (RA2) , Q.3 (RA3) > Q4 (RA4 ) , Q5 
(RBI) , Q6 ( RB2 ) , Ql (RB3) , Q8 (RB4 ) , Q9 (RA5) , Q10, Qll, Q12} 
Monitoring registers: {Ml (0) , M2 (0) , M3 (0) > M4(0), M5(2), M6 
10 (2), M7(0), M8(0), M9(0), M10, Mil, M12} 

The read requests RA5, RA6 both retrieve data stored on 

the same, page A. However, the read request RA5 in the request 

i 

queue 40, is the last one. The read request RA6, therefore, 
.15 is also pushed into the request queue 40 (steps 106, 104) and 
is stored in the queue entry Q10. In addition, the initial 
value is assigned to the monitoring register M10 
corresponding to the queue entry Q10. The parameters are shown 
as follows : . 

20 ! 

i 

Read requests: { RB5, RB6 } 

Queue entries: {Ql (RAl), Q2 (RA2) , Q3 (RA3) , Q4 (RA4), Q5 
s (RBI), Q6 (RB2), Q7 (RB3) , Q8 (RB4), Q9 (RA5) , Q10 (RA6) , Qll, 
Q12} |. ■ , '■ 

25 Monitoring registers: (M1(0), M2(0), M3(0), M4 (0) , M5(2), M6 
(2), M7(0), M8(0), M9(0), M10(0), Mil, M12} 

i 

! 

i - 
The read request RB5, as RB4, now intends to access data 
stored on the same page B (step 102) . In addition, the read 
30 request RB4 is not the last request in the request queue 40 

(step 104) . The latency value of the monitoring register M9 

j ■ 

is still 0, and will become 1 after additional delay, and will 
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not be greater than the maximum allowance value (step 108) . 
Therefore, the read requests RA5, RA6 are reordered due to 
the insertion of the read request RB5 (step 110) . The queue 
entry Q9 stores the read request RB5 instead, and the read 
5 requests RA5, RA6 are delayed so that the queue entries Q10, 
Qll hold.the- read requests RA5, RA6 respectively. Because the 
read requests RA5, RA6 are delayed, the related latency values 
are adjusted- In addition, the initial value is assigned to 
the monitoring register M9 corresponding to the queue entry 
10 Q9 holding the inserted read request RB5. The resulted 
reordering parameters are as follows: 

Read requests: {RB6} 

Queue entries: '{Ql (RA1) , Q2 (RA2) , Q3 (RA3) , Q4 (RA4), Q5 
• 15 (RBI) , Q6 (RB2), Q7 (RB3) , Q8 (RB4), Q9 (RB5) , Q10 (RA5) , Qll 
(RA6) , Q12} 

Monitoring registers: {M1(0), M2{0), M3(0), M4,(0) , M5 (2) , M6 
(2), M7(0), M8(0), M9(0), M10(l), Mll(l), M12} 

20 The read request RB6, as RB5, intends to access data' stored, 

on the same page B (step 102). In addition, the read request 
. ' RB5 is nqt the last request in the request queue 40 (step 104) . 

It is obvious that the latency value of the monitoring 
register M10 after adjustment is not greater than the maximum 
25 allowance value (step 108) . Therefore, the read requests RA5, 
RA6 are ! reordered again due to the insertion of the read 
request RB6 (step 110) . The queue entry Q10 stores the read 
request :RB6 instead, and the read requests RA5, RA6 are 
. further delayed so that the queue entries Qll, Q12 hold the 
'30 read, requests RA5, RA6 respectively. Because the read 
requests: RA5, RA6 are delayed, the related latency vaiues are 
adjusted again. In addition, the initial value is assigned 

i - • 
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to the monitoring register M10 corresponding to the queue 
entry. Q10 holding the inserted read request RB6. The final 
reordering parameters are shown as .follows: 

5 Read requests: {} 

Queue entries: {Ql (RA1) , Q2 (RA2), Q3 (RA3) , Q4 (RA4), Q5 

(RBI) , Q6 (RB2), Q7 (RB3) , Q8 ( Rfi 4 ). , Q9 (RB5) , Q10 (RB6) , Qll 

i ' 

(RA5), Q12 (RA6) } 

Monitori'ng registers: {Ml (0) , M2 (0) , M3(0), M4 (0) , M5(2), M6 
10 (2), M7(0), M8{0), M9(0), M10(0), Mll(2) r M12(2)} 

Fig. 6 is a diagram showing data access operation of. the 
memory device 20 after the above described reordering process 
is done J As described above, the final request sequence 

15 becomes RAl, RA2, RA3, RA4, RBI, RB2, RB3, RB4, RB5, RB6, RA5, 
and RA6,i and has been changed from the original .request 
sequence: RA1, RA2, RBI, RB2, RA3, RA4, RB3, RB4, RA5, and 
RA6. As shown in Fig. 6, two page switch operations caused by 
two page misses introduce two latencies LI for the memory bus, 

20 and only one latency L2 for the host bus. Compared with Fig. 2, 
Fig. 6 shows that the reordering of the read requests as 
described above saves significant time spent on switching 
pages. In addition, utilization of the host bus is more 
efficient. It is also shown that the data access operation 

25 requires; a shorter period of time to complete delivering 
required data back to the master device. 

In addition,-- the claimed memory controller 30 is located 
within ai north bridge circuit. This north bridge circuit, 
30 therefore, is capable of reordering access requests and is 
denoted by a reorder-enabled north bridge circuit. 

■ i 



Comparing with Fig. 2, Fig. 6 also shows that latency L2 is 
greatly reduced. The overall performance of the data access, 
operation is thus also greatly improved due to a better 
utilization of the host bus. A maximum allowance value is used 
5 in this disclosure to prevent the performance of the host bus 
from being worsened when reducing the time spent on switching 
pages. As clearly shown in- Fig. 6, an ending of the memory bus 

activity is close to that of the host bus activity. A 

j 

compromise of the memory bus utilization and the host bus 
10 .utilization is achieved to optimize performance of data 

retrieval. It is noted that the maximum allowance value is 

j 

programmable to meet requirements of different data 
processing systems for appropriately balancing performance 
of the memory bus and performance of the host bus. In other 
15 words, optimal performance of the memory bus and the host bus 

i 

is obtained with the help of the appropriately programmed 
maximum allowance value. 

The^access requests accessing the same page are not only 
20 simply grouped, but also grouped to the extent practical so 

i 

as not to violate the set maximum allowance value. That is, 

t 

if an insertion of a newly received access request seriously 

i 

delays previously queued access requests, the newly received 

j 

access request is just pushed into the request queue as usual 

! 

25 without ^invoking the reordering process. By closely 

i 

monitoring the latency values of the previously queued access 

I 

requests, the performance of the host bus will not be greatly 
deteriorated. As a consequence, the performance of the memory 
bus and the performance of the host bus are balanced to 
30 optimize the overall performance of the data processing 
system. 



Those skilled in the art will readily observe that numerous 
modifications and alterations of the device and the method 
described may be made while retaining the teachings of the 
invention. 



